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INTRODUCTION 

The chromosomal DNAs of Escherichia coli and 
Salmonella typhimurium share many properties. A 
single, continuous, negatively supercoiled molecule 
is folded into a nucleoid structure that bends at 
specific sequences and forms nucleosomelike struc- 
tures by interaction with histonelike protein mole- 



cules (for rninireviews. see references 27, 69, and 
70). 

Extensive electrophoretic characterizations of al- 
lozymes from more than 1,600 isolates of E. coli have 
revealed "strong multilocus associations" (86, 87); in 
other words, there are only a few clonal types of £- coli 
bacteria, and there seems to be little genetic exchange 
among these enteric organisms in their natural envi- 
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ronments. Despite the stability of the genomes of 
these contemporary organisms, their chromosomes, 
like those of other organisms, have changed during 
evolutionary time. In this chapter, detailed compari- 
sons of the sequence of loci in £ coli and S. typhimu- 
rium are presented, and the means by which chromo- 
somal material in these organisms can be acquired 
and rearranged are discussed 

CONSERVED FEATURES 
Size and Complexity 

Physical measurements 

The near equivalence of £. coli and S typhimurium 
genome sizes is evident when they are compared with 
the range of sizes that occur among procaryotes (6, 
9) The smallest procaryotic genome, 04 x 10 
da! tons, occurs in Chlamydia trachomatis, and the 
largest, 8.6 x 10 9 daltons, occurs in Calothrix spp 
(45). The genomes of E. coli and S. typhimurium 
form a cluster within this spectrum. By relating the 
genetic map distances with physical measurements 
of size of the corresponding DNA, the conversion 
factor of 45 to 46'kilobases (kb) per map unit has 
been obtained for two different, extensive regions of 
the E. coli K-12 chromosome. If these regions are 
representative of the whole, and if all map units are 
comparable, this conversion factor yields a value of 
3.0 x 10 9 daltons for the 100-map-unit chromosome 
of £, coli K-12 (11,41) 



Renaturation 

Approaching the characterization of bacterial DNA 
from another direction, the degree of complexity of 
the DNA has been estimated from second-order rena- 
turation kinetics (14, 1 17). Values were obtained that 
ranged from approximately 3.8 x 10 6 base pairs (bp) 
in unique sequences for £ coli C (13, 37) through a 
value of 4 J x 10 6 bp for an isolate of S. typhimurium 
Extensive data indicate that the overall complexities 
of Salmonella strains are generally 10 to 20% greater 
than that of £ coli K-12 (24). Despite this level of 
variability, some of the values obtained for some of 
the strains wer e nearly identical for the two bacteria. 
For example, the complexity of E. coli K-12 was 
determined to be 4,0 x 10* bp, and that of S. typhimu- 
rium LT7 was 4,2 x 10 6 bp It may be noted that these 
complexities refer to the entirety of the complements 
of unique sequences of DNA in the cell, the sum of 
both plasmid and chromosomal elements. The pres- 
ence of a plasmid in one organism and the absence of 
an equivalent structure in a second organism could 
contribute to greater or lesser levels of perceived 
similarity in complexity. 

Besides unique sequences, part of the genomic DNA 
is in the form of repeated sequences Experiments on 
the fast-renatur ing fraction of the DNA have revealed 
that 0.5 to 2% of the sequences in the £ coli genome 
may be "foldback" DNA, or closely linked reiterated 
sequences (21, 61, 71) 

In sum, the results indicate that there is some 
variation in the measure complexities of genomes in 
any one species but that the sizes of genomes in 



strains representing both £ coli and S. typhimurium 
are approximately equivalent. 

Base Composition 

Uniformity 

The base compositions of bacterial DNAs differ one 
from another, but the chromosomal DNA of any one 
bacterial strain is relatively uniform in composition 
throughout its genome. The width of the distribution 
curve for DNA fragments that have been centrifuged 
to equilibrium in CsCl gradients is a function of the 
size distribution of the fragments and of the extent of 
heterogeneity in base composition Judging by the 
width of the profiles of equilibrium bands of bacterial 
DNAs, the guanine-plus-cy tosine (GC) contents cluster 
closely about distinct mean values (95) Because un- 
usually broad CsCl DNA profiles have not been ob- 
served, extremes of base composition must be con- 
fined to nucleotide sequences that are shor t relative to 
the lengths of shear fragments of chromosomal DNA 
(>10 6 bp), and the majority of DNA shear fragments 
must possess GC contents that are close to the chro- 
mosomal average 

Most S typhimurium strains are I to 4% higher in 
GC content than are most £. coli strains (103) When 
the GC contents of both £ coli K-12 and 5. typhimu- 
rium LT2 were measured in the same series of exper- 
iments by determining either the thermal denatur- 
ation temperature or the buoyant density in CsCl, the £. 
coli K-12 DNA was found to be 1 .5 to 2.0% lower in GC 
content than was S typhimurium LT2 DNA (7, 77, 99). 

Bias in third position of codon 

There appears to be a mechanism that conserves the 
GC content of the genome of a given bacterial strain 
within a relatively narrow range. The explanation of 
how the GC content of bacterial DNA is established, 
maintained, or modified is not yet at hand, but a part 
of the mechanism seems to lie in the selection of 
synonymous codons having suitable GC content Com- 
parison of the nucleotide sequences of corresponding 
genes in £- coli and S. typhimurium, such as genes of 
the trp operon, supports this view. In both of these 
bacteria, the trp structural genes are higher in GC 
content than the average value for the respective 
genomes. Each of the S typhimurium genes has a 
higher GC content relative to each corresponding E> 
coli gene (Table 1) Almost half of all nucleotide 
differences between the £ coli and S. typhimurium trp 
genes occur in the third positions of codons Most of 
the nucleotide differences in the genes of £ coli versus 
the genes of S. typhimurium generate synonymous 
codons. Selective codon usage clearly can be a tool for 
establishing or adjusting GC content 

When one examines GC content at higher levels of 
organization, e.g., contrasting structural genes with 
regulatory regions, heterogeneity in base composition 
is revealed. On the one hand, some regulatory se- 
quences are AT rich (e.g., control sequences for the Ipp 
and ompF genes [56, 79] and noncoding sequences 
upstream of leuABCD in both £. coli and S. typhimu- 
rium [43]) On the other hand, among individual 
structural genes some are higher in GC content and 
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TABiE ) GC content of trp structural genes of £ coli 
K-12 and S. typhimurium 1T2 



GC content {%) 



Genes (reference) 

jrpA codons (80) 
All positions 
Third positions 

trpB codons (23) 
All positions 
Third positions 

trpG{D) (81) 
All positions 
Third positions 

irpE {120) 
All positions 
Third positions 



E coli K-12 



S typhimurium LT2 



54 
56 

54 
60 

55 
61 

54 
55 



57 
63 

57 
64 

57 
62 

58 
62 



some are lower in GC content than the chromosomal 
average. These departures from the average affect 
genetic entities that are small (<10* dal tons) relative 
to the entire chromosome, suggesting that the mech- 
anisms that determine GC content of the genome are 
able to sense localized heterogeneity of a gene or an 
intercistronic region and to determine the degree ot 
compensation that is needed to counterbalance any 
small aberrant regions. The mechanism that a cell 
possesses for sensing GC content and for invoking 
compensatory adjustments has yet to be discerned. 



Gene Order 

Coincidence of loci 

Genetic maps of related organisms tend to be simi- 
lar (62) Sanderson (96) noted that there are many 
similarities in the genetic maps of E coli and 
typhimurium. More genes have been mapped in botii 
organisms since that time, and it remains true that 
the two maps are congruent, A previous exercise in 
map comparison (94) has been brought up to date here 
by using the most recently compiled map information 
(5 97) (see Fig. 1) There are some exceptions with 
respect to gene order and distances between markers 
that will be discussed below, but, in general terms, a 
remarkable degree of congruence has survived in the 
genomes of two organisms which are believed to have 
diverged from one another many millions of years 
ago, and the DNAs of which have replicated indepen- 
dently of each other many millions, perhaps billions, 
of times. 

Since £. coli and S. typhimurium are able to engage 
in sexual recombination with each other only at very 
low frequency, it is doubtful that the need to preserve 
opportunities for homologous recombination oyer 
long linkage distances has served as a stabilizing 
effect on gene order. Instead, the conservation of gene 
order implies that there is a functional correlate of 
this gene order that confers a growth or survival 
advantage on the organism Even in an organism as 
far removed from the enteric bacteria as Bacillus 
subtilis, the arrangement and function of a set of 
genes, those near the origin of replication, exhibit 
important similarities with the corresponding genes 
in E. coli (88), suggesting that function and gene 



location are not entirely independent for this set of 
genes. 

Chromosomal rearrangements 

Whatever the (unknown) advantage of a particular 
gene order may be, it is not so powerful that it 
prevents survival of some £. coli or S typhimurium 
variants that have undergone rearrangement of parts 
of their genomes In fact, a large inversion of approx- 
imately 15% of the genome has occurred in the evolu- 
tionary past in either £ coli or S. typhimurium with no 
known ill effect (19). There are other instances m 
which variants have arisen spontaneously, having 
sustained rearrangements within their genomes^In- 
version of 17% of the genome in £. coli W2637 and its 
descendant, W31 10, arose spontaneously as a result ot 
recombination between two nearly identical rrn loci 
in the £. coli genome (50). Another large inversion was 
discovered in the lac region of £. coli SY99, apparently 
produced by recombination between two copies of IS3 
that flank the inverted segment (98). In addition to 
inversions, duplications also arise spontaneously in 
laboratory strains. In the course of a study of the recB 
and recC genes, a duplication in the thyA-argA region 
was discovered in £ coli KL399 (29), These examples 
of spontaneous changes in map locations of chromo- 
some segments and of individual genes show that even 
though the order of most genes in the chromosomes of 
£. coli and S. typhimurium has been preserved, change 
is not excluded altogether, Rearrangements seem to 
be discouraged in the sense that they are rare events, 
yet they are not completely forbidden. 

In the laboratory, selections have been designed 
with an eye 'to detecting rearrangement mutations 
(e.g., references 51, 100). Inversion mutants have been 
isolated, and characterization has shown that there 
are constraints on the location of the endpoints ot 
inversions that seem to relate in some way to the 
spatial relationship between the origin and the termi- 
nus of replication. When selection for a rearrange- 
ment involving the /lis operon was exerted, inversion 
mutants were obtained, although at much lower 
frequency than duplications. Endpoints of the inver- 
sions were not located at random, but were such as 
to bring the terminus of replication closer to the 
origin (100). . . , , 

Integrity of the terminus of replication in the chro- 
mosome has been shown to be important for the 
health of the cell Deletion of 340 kb of DN A from the 
terminus region of the chromosome interfered with 
control of the travel of replication forks and caused 
poor growth and reduced viability (44). When a plas- 
mid was integrated into the chromosome near the 
terminus of replication, some £. coli strains suffered 
physiological damage and became sensitive to growth 
on rich medium (75) Revertants resistant to rich 
medium were isolated, and some of these were found 
to have undergone major chromosomal rearrange- 
ments: inversions of approximately half of the chro- 
mosome along the axis of the origin and terminus ot 
replication. The phenotypic effects on the rate ot 
growth in rich medium seem to be the consequence ot 
altered velocities of replication forks (75) Thus, at 
least some chromosomal rearrangements can be 
shown to confer a growth disadvantage on the orga- 
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nism, providing a partial explanation for the tendency 
to conserve global gene order 

Secondary Structure 

Secondary structures in the transcripts of structural 
genes are important features of mechanisms for initi- 
ating, terminating, and regulating gene expression. 
Comparison of regulatory sequences of analogous 
genes in £ coli and S typhimurium has shown that, in 
regulatory regions, secondary structures are well con- 
served even when base sequence has undergone 
change. Despite nucleotide sequence variation, the 
alternate stenvand-loop structures of the attenuation 
regions of the trp operon were conserved in a group of 
enteric bacteria, among them £. coli and S. typhimu- 
rium (I 19) Similarly, sequences at the 3' end of the trp 
operon in £ coli and S. typhimurium are not highly 
conserved in base sequence, but the stem-and-loop 
structure for the transcript terminator has been re- 
tained (81). The sequences and secondary structures of 
the leader sequences of the ilvCEDA operons of £. coli 
and S. typhimurium are highly conserved (42); Also, 
the inverted repeat sequences that provide the poten- 
tial for stem-and-loop structures in transcripts of 
intercistronic regions in the his operon and in the 
malB region have been highly conserved (36, 46). 

PATTERNS OF REPEATED SEQUENCES 

Recurrent sequences, which occur at different orga- 
nizational levels, appear to be significant in influenc- 
ing the structure of the genome . There are gene dupli- 
cations, short duplicated segments within genes, large 
(>l-kb) directly repeated or inverted sequences, and 
small (<50-bp) repeated sequences that are located 
both within and between coding sequences. 

Gene Duplication 

Multicopy genes 

rrn loci (which commonly have the sequence pro- 
moter t -promoter 2 -16S rRNA-tRNA-23S rRMA-5S 
rRNA^tRNA-terminatori-terminator}) occur at sever- 
al sites on the chromosomes. Both £. coli and S. 
typhimurium have seven rrn loci which are present at 
equivalent positions in the two genomes (66) Further- 
more, the left-to-right orientations of the loci in the 
chromosomes in these two enteric bacteria are the 
same. Whereas the overall structure of these loci is 
highly conserved, differences based on the identity of 
tRNA in the spacer region have been recognized. In 
particular, four loci in both £ coli and S, typhimurium 
contain tRNA c,u , and three contain tRNAf{? and 
tRNA I,e . Interspecific comparisons reveal that rrn loci 
at equivalent positions in the genome do not necessar- 
ily contain the same tRNA In £. coli, rrrtD contains 
tRNAfy 3 ; in S. typhimurium, the rrriD locus contains 
tRNA? Tu (66). The reciprocal relationship exists in 
rrnB; in £. coli this locus contains tRNA c,u , whereas in 
S, typhimurium tRNAfl? is present (66). Because of the 
high degree of homology among the rrn loci, one 
might expect frequent exchanges of segments among 
rrn loci. In fact, rearrangements are rare, and the 
organizations of these loci are remarkably stable. 



Duplicated genes 

Sequence similarities (homologies) are evident in 
the nucleotide sequences of many pairs or groups of 
structural genes or in the amino acid sequences of 
their protein products. Such genes are believed to be 
related by duplication of an ancestral gene followed 
by divergence of sequence and function The most 
extensively documented examples pertain to biosyn- 
thetic operons and to the sensory transducers that 
modulate responses to attractants and repellants in 
the environment Pairs and groupings of structural 
genes which have extensive homologies are listed in 
Table 2. Selected examples will be used here to illus- 
trate some general features of duplicated genes. 

Ornithine and aspartate carbamoyl transferases. 
The genes for the enzymes ornithine carbamoyltrans- 
ferase and aspartate carbamoyl transferase in £ coli 
exhibit levels of homology that are to be expected 
either between recently duplicated genes or between 
ancestrally related genes (53, 114). Ornithine carba- 
moyl transferase is specified by two loci in £ coli K-12, 
argl located at map position 97 and argF at position 7 
(5). A total of 781% of the nucleotides in the two genes 
are identical, and 86% of the 333 amino acids of the 
enzymes are identical. It seems likely that these two 
genes arose recently by duplication followed by diver- 
gence and transposition. 

The argl locus for ornithine carbamoyl transferase is 
linked to the pyrB locus, which specifies aspartate 
carbamoyltransferase. This latter enzyme contains 
310 amino acids. By introducing gaps, the codons in 
pyrB can be aligned with the nucleotide sequences for 
ornithine carbamoyltransferase. Such a comparison 
r eveals that pyrB is 35 to 40% homologous to both argl 
and argF (53, 1 14). Among these three genes, pyrB has 
diverged from argF and argl more than the latter two 
have from each other The pattern of divergence sug- 
gests that the duplication event that separated the 
ornithine carbamoyltransferase and aspartate carba- 
moyltransferase genes preceded the duplication that 
generated the two genes for ornithine carbamoyltrans- 
ferase. 

Chemoreceptors. In £. coli, four loci are known 
which specify chemoreceptors {tar, tap, tsr, and trg). 
Just as for the arg and pyr genes discussed above, the 
loci that are genetically linked (tar and tap) are less 
related to one another than either is to the unlinked tsr 



TABLE 2. Sets of closely related genes presumed to have 
arisen by duplication of an ancestral gene 



Genes Rcfcrcnce(s) 



argF, argl, pyrB . - 53. 114 

aroF. aroC 25, 102 

hisJ, argT ... 47 

ilvBN, ilvGM, iMH 35. 106, 1 16 

lysC, metl, thrA . . . 18, 34 

metB, metC 8 

ompC, ompf, phoE ... 56. 78, 89 

tar, tap, tsr, trg 10, 12, 63 

trp(C)D,pabA 52,60 

trpE, pabB . 39. 82 

tufA, tufB. fus 2, 121, 123 

sucB,aceF . . 105. 107 

xap, desD 17 
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locus (63 f 115). (The fourth locus, trg t is even less 
related [10].) Comparison of the chemoreceplor se- 
quences reveals that levels of homology are not uni- 
form throughout the length of a locus. The protein 
products have been divided into six domains; some 
are common to the various proteins (e.g., a rnethyla- 
tion region), and others are distinct in each protein 
(e g., attractant recognition sites). The distinctive por- 
tions are concentrated in the amino end of the protein, 
whereas the common regions tend to be at the carboxy 1 
end. As a consequence, levels of amino acid sequence 
similarity are low (10 to 60%) at the amino end and 
high (60 to 100%) at the carboxyi end. 

It should be noted that common function in a 
protein does not necessarily signify nucleotide homol- 
ogies in the corresponding genes Among the various 
sensory-transducing proteins, there is little homology 
among the membrane-spanning segments The low 
level of identity may reflect a low level of specificity 
required for the function. The membrane-spanning 
section just precedes the region of high homology. 
Presumably, some functionally important segments of 
ancestral genes have been conserved while other por- 
tions with more flexible requirements have diverged. 

Cystathionine-7-synthase and p-cystathionase. The 
unlinked genes metB and metC encode adjacent enzymes 
in the pathway of synthesis of methionine. Nucleotide 
sequences have been determined, amino acid sequences 
have been deduced, and sequences have been com- 
pared (8). Sequence similarities are evident, distrib- 
uted throughout both the genes and the proteins, sug- 
gesting that sequential enzymes in a metabolic pathway 
could have arisen by gene duplication and divergence. 

Aspartokinase-homoserine dehydrogenase. In E. 
coli, the thrA and metL loci specify, respectively, the 
bifunctional isozymes aspartokinase I-homoserine de- 
hydrogenase I and aspartokinase II-homoserine dehy- 
drogenase II (34), The tetrameric enzymes have three 
domains; one at either end of the polypeptide chain 
for each of the two catalytic activities and a central 
domain concerned with subunit interaction The two 
bifunctional enzymes have similarities of structure 
and sequence that suggest strongly that the two genes 
arose by duplication of an ancestral bifunctional gene 
(34) 

Unlike the aspartokinase products of metL and thrA, 
the aspartokinase specified by a third locus, /ysC, is 
not bifunctional. Nonetheless, the amino acid se- 
quence of this protein exhibits strong homology with 
the aspartokinase domains, the subunit interaction 
domains, and the proximal por tion of the homoserine 
dehydrogenase domains of the aspartokinase-homo- 
serine dehydrogenase enzymes I and II. The evidence 
suggests that the lysC locus originated from an ances- 
tral gene that also served as the progenitor of met and 
thrA (18) Presumably, in subsequent evolution of the 
genome, segments required for the homoserine dehy- 
drogenase activity were deleted from the lysC locus, 
retained in metL and thrA^ 

Internal Sequence Repetitions 

Aspartokinase-homoserine dehydrogenase 

The nucleotide sequences specifying some proteins 
have small internal segments that are repeated within 



the coding region of the locus. The thrA and metL 
genes are not only examples of whole-gene duplica- 
tions, as discussed above, but they serve as examples 
of internal duplications as well (34). From the nucle- 
otide sequence of thrA, an 820-amino-acid sequence 
can be inferred which would constitute the primary 
structure of aspartokinase I-homoserine dehydrogen- 
ase 1. Likewise, a protein with 809 residues can be 
derived for aspartokinase II-homoserine dehydrogen- 
ase II. When residues 9 to 128 and 146 to 263 of 
enzyme I are compared, 22% of the nucleotide resi- 
dues are identical If residues are scored as matches 
when no change results in the amino acids specified, 
the similarity increases to 39%, and if "acceptable" 
amino acid substitutions are allowed (namely, isoleu- 
cine ■ leucine = valine, serine = threonine, phenylal- 
anine ~ tyrosine, arginine = lysine, and aspartate = 
glutamate), the relatedness increases to 44%. These 
sequences can be viewed as repetitions which oc- 
curred within the amino- terminal part of the thrA 
gene and which have diverged over time. A similar 
analysis of the carboxy end of the gene shows another 
repetition. Furthermore, one of the repeated segments 
specifying the carboxy end of the protein has a nucle- 
otide sequence that recurs in the sequence of nucleo- 
tides that separates the two large repeats Thus, the 
deduced amino acid sequences indicate that both the 
amino end and the carboxy end of the molecule 
contain duplicated segments. Analysis of asparto- 
kinase II-homoserine dehydrogenase II reveals that it 
has a similar organization 

Carbamoylphosphate synthetase 

The carB locus of E coli specifies a subunit of 
carbamoylphosphate synthetase. Examination of the 
3,219-nucleotide sequence that specifies the sequence 
of 1,072 amino acids in the yV-formylmethionine-de- 
pleted protein indicates that there is a high level of 
homology between the amino end and the carboxy end 
of the enzyme. In particular, when the first 400 resi- 
dues of the protein are compared with residues 553 to 
933 (and adjustments are made for deletions and 
insertions), 39% of the amino acids are identical. 
When acceptable amino acid substitutions are scored, 
the relatedness rises to 64% Also, when residues 401 
to 552 and 933 to 1072 are compared, identities of 20% 
and relatedness of 45% are seen. Accordingly, the 
evidence suggests that the coding sequence for this 
enzyme may also have arisen from internal duplica- 
tions (85). 

Internal repetitions in thrA, thrB, and thrC products 

The genes of the thr biosynthetic operon share short 
internal sequences. For the most part, the three genes 
for biosynthesis of threonine, thrA, thrB t and thrC, are 
unrelated, as are the bulk of the amino acid residues 
in the corresponding enzymes. However, there is a 
35-aminoacid region that appears twice in the thrA 
enzyme, once in the thrB enzyme at a location com- 
parable to one of the thrA positions, and once in the 
thrC enzyme at a location comparable to the other 
position in thrA. These sequence similarities could 
reflect some common functional feature of the three 
enzymes (90) 
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Large (>l-kb) Repeated Sequences 
£. coli and S typhimurium chromosomes harbor 
sets of repeated sequences that contribute to rear- 
rangement of the genetic material. Repeated se- 
quences flanking unique sequences have been identi- 
fied which bear a structural relationship to insertion 
sequences and transposon organizations and which 
may bear a functional relationship as well 

rm loci 

Chromosomal rearrangements may come about by 
unequal crossing over between sister chromatids; the 
specific sites of crossing over between the sister chro- 
matids can be any repeated sequence. The recurrence 
of the seven rrn loci in E. coli and S. typhimurium 
serves as the basis of the formation of a set of dupli- 
cations and deletions (49, 67, 68), transpositions (51), 
and inversions (50). Duplications and deletions pre- 
sumably involve rrn loci with the same orientation, 
inversions involve rm loci with opposite orientations, 
and transpositions, depending on their character , may 
involve either orientation. 

Hill and collaborators (49) hypothesized that, as a 
consequence of crossing over between reiterated rm 
loci a variety of rare, covalently closed, circular DNAs 
should be excised from the chromosome of £. coli 
mutants having duplications in the glyT region. Such 
molecules with the predicted contour lengths were 
isolated. R-loop analyses demonstrated the expected 
numbers and positions of rrn loci in these molecules. 
These results provided substantive evidence that rrn 
loci served as the basis of deletion of duplicated 
segments of the genome The same reasoning indi- 
cates that the duplications arose originally through 
unequal crossing over between rrn loci. Similar obser- 
vations have been made with S. typhimurium (67). 
Furthermore, inversion (50) and transposition (51) ot 
segments of the £. coli genome bounded by rrn loci 
also have been demonstrated. 

Genetic evidence has been acquired which substan- 
tiates that rrn loci are associated with duplications . In 
. S. typhimurium, detection of duplications at 38 loci 
has been achieved by first introducing Tn/0 into 
specific loci (thus creating auxotrophs), then introduc- 
ing wild-type alleles by transduction, and finally se- 
lecting for both prototrophy (a feature of the wild-type 
allele) and tetracycline resistance (a feature of the 
Tn/0 insertion in the mutated allele) (3) Mapping the 
extents of duplications in isolates recovered from such 
selection regimes revealed duplications bounded by 
rm loci . Comparisons of recipients transduced to 
prototrophy alone versus transduction to prototrophy 
plus antibiotic resistance revealed that duplications 
are commonplace. Specifically, the segment of the 
genome bounded by the rrnB and miE loci is mero- 
diploid in approximately 3% of the organisms in a 
rapidly growing population, whereas other regions 
bounded by rrn loci are duplicated with a frequency 
between 10" 4 and 10~ 3 per cell. 

Wis loci 

£ coli K-12 has at least three rhs (rearrangement 
hot-spot) loci that have highly similar nucleotide 
sequences, lack a known phenotype, and permit rear- 



rangements reminiscent of those based on rrn loci 
(72). Two of these loci, rhsA and rhsB, have been 
mapped and were determined to have the same ori- 
entation on the chromosome. The former is approxi- 
mately 140 kb clockwise from the latter; the sequence 
of intervening genes is pit-glyS-xyl. The rhs loci contain 
3,8-kb "cores" that can form SI -resistant heterodu- 
plexes Unequal sister chromatid exchange between 
rhsB and rhsA accounts for the observed high frequen- 
cy of glyS duplications. The sizes of rhs loci and the 
frequencies of rearrangements, involving these loci are 
equivalent to the sizes of and rearrangement frequen- 
cies involving rm loci . Sequences homologous with 
rhs have not been detected in S. typhimurium. 

rtl-atl/gat alternation of alleles 

The role of flanking sequences in genome organiza- 
tion has been further adduced by studying the relation 
of some genes that confer the capacity to oxidize 
polyalcoholic sugars (73) The ability to metabolize 
ribitol and D-arabitol occurs among 10 to 20% of £. 
coli strains isolated from native habitats. Such meta- 
bolic capacity exists in C strains of E, coli but not in 
either B or K-12. The latter strain is able to metabo- 
lize galactitol The genes specifying the ability to use 
ribitol or D-arabitol are immediately bounded by 
inverted, imperfectly repeated 1.4-kb sequences When 
the operons for pentitol utilization were transduced into 
strains with functional galactitol genes (gat) and select- 
ed for pentitol catabolism, the pentitol operons were 
found to have displaced the gat genes. Likewise, when 
the hexitol genes were transduced into strains with the 
capacity to metabolize ribitol or D-arabitol, the strains 
commonly lost the pentitol operons as they gained the 
gat genes (74, 118). Recombination at the flanking se- 
quences is the postulated basis for this allelic relation- 
ship. Such reciprocal exclusion is the first example in a 
monoploid organism of genuine alleles existing as alter- 
native forms having high natural frequencies. 

Insertion sequences 

Multiple copies of many kinds of insertion sequences 
are often present in the genomes of £. coli and S, 
typhimurium strains, conferring properties of move- 
ment and change Ornithine carbamoyl transferase (as 
noted earlier) is coded for by the argl and argF loci; 
the latter gene is flanked by direct repeats of IS/. 
Selection for hyperproduction of the enzyme can yield 
mutants which have the argF gene amplified as many 
as 45 times. Examination of restriction endonuclease 
digests of DNA from mutants indicates that the extent 
of the amplified sequence corresponds to the distance 
between the two IS/ elements. The amplification 
occurs only when an F factor is integrated nearby in a 
cis position Whether the reiterated genes are inte- 
grated into the chromosome or exist as an extrachro- 
mosomal element has not yet been demonstrated (57) 



Short (<50-bp) Repeated Sequences 

ampC 

Direct reiteration of the ampC gene of E. coli 
(which specifies ^-lactamase) confers resistance to 
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ampicillin in direct proportion to the number of 
gene copies Selection for ever greater levels of 
resistance allows the isolation of mutants that have 
DNA segments containing the ampC determinant 
repeated as many as 40 times on the chromosome or 
50 times on ColEl -derived plasmids (31). When the 
nucleotide sequences of the termini of repeated 
segments, as well as the novel "join point" within 
the repeated segments, were compared, they were, 
seen to share 12-bp lengths that were identical (32). 
The sequence showed no elements of symmetry. 
Analysis of other independently produced duplica- 
tions revealed that endpoints occurred at different 
positions in approximately 8 kb of chromosomal 
DNA. Statistical arguments indicate that, in a ran- 
dom nucleotide sequence, sequences having exactly 
the same order of 12 nucleotides are likely to occur 
twice in a length of 4 kb, three times in one of 6 3 kb, 
and four times in one of 8 kb. The frequency of short 
nucleotide sequences in the bacterial genome may 
be modulated such that they contribute to the 
overall organization of the genome. 

lac 

A series of compensating mutations at the begin- 
ning of the lac operon of £ coli contribute to a 
selection scheme that yields amplified segments 
(112). In this regime, the lacl and lacZ loci have been 
fused to produce a hybrid gene lacking the distal 
portion of the former and the proximal portion of 
the latter; the gene product, nonetheless, is func- 
tional If expression of the / gene is damaged, 
(3-gaIactosidase production can occur only as a con- 
sequence of translation reinitiation sites down- 
stream. When organisms containing all of these 
genetic alterations are selected for lactose utiliza- 
tion, unstable lac + isolates are obtained. Such 
mutants contain 40 to 80 copies of the altered lac 
operon on DNA segments ranging in size from 7 to 
37 kb (with the representative size ranging from 15 
to 20 kb). When the extents of amplified segments in 
133 isolates were examined by restriction endonu- 
clease mapping, some preference in the location of 
endpoints was observed. On the left side of the lac 
operon, the amplified segments originated more or 
less equally within any of three Hindi fragments 
and never or rarely originated within two other 
Hindi fragments. At the right end of the amplified 
segment, endpoints occurred in a region spanning 
14 8 kb. Of these, more than 35% occurred in adja- 
cent 0.4- or 2.6-kb Hindi fragments Thus, the 
endpoints of independent recurrences of amplified 
segments were found to be clustered but not identi- 
cal. 

Short repeated sequences have been observed in 
the lad region and are known to function in the 
formation of deletions. The most frequently used 
sequence (involved in 60% of approximately 250 
deletion mutants) was composed of 17 bp Analysis 
of the sites at which various deletion mutants orig- 
inated revealed that J 4 of the 17 bp were highly 
conserved (1). Determining whether such sequences 
also contributed to the production of lac repetitions 
awaits nucleotide sequencing of the origins, termi- 
nations, and join points of the amplified segments. 



hin, gin, cin, pin 

A site-specific inversion system resides in the S. 
typhimurium chromosome around the H2 antigen 
locus The Hin enzyme inverts a segment of about 1 
kb of DNA by directing recombination between a 
pair of flanking inverted repeated sequences of 14 
bp each (104). Closely similar site-specific inversion 
systems exist in phage Mu (gin), in phage PI (cin), 
and within a prophage in the £. coli chromosome 
(pin) (22). 

REP sequences 

Recently, "repetitive extragenic palindromic" (REP) 
sequences have been identified as major components 
in the genomes of E. coli and S. typhimurium (38, 46, 
108). In £. coli, a consensus sequence of 38 bp was 
detected in 25% of the operons examined. Extrapola- 
tion from this evidence indicates that the REP se- 
quence itself may occur as many as 500 times and may 
constitute more than 0.5% of the genome. The func- 
tion of REP sequences has not been established. They 
appear not to regulate transcription, affect transla- 
tion, promote gene expression, or function as termi- 
nators However, when inversion mutants in the his 
operon of S.. typhimurium were selected by having , 
histidine prototrophy restored through fusion of the 
inverted hisD locus to new functional promoters, 7 of 
10 inversions originated in an intercistronic REP 
sequence between hisG and hisD (100) 

hisM 

"Odd group" mutants of S. typhimurium have 
completely lost the wild-type capacity to transport 
i-histidine and have a markedly improved ability to 
transport t-histidinol. Examination of the nucleo- 
tide sequence of the hisM gene reveals a 29-bp 
region that contains multiple repeated sequences; 
the segment can be viewed as having two direct 
decanucleotide repeats separated by 2 bp- The nucle- 
otide sequence in seven separately isolated mutants 
differed from the wild-type sequence in having the 2 
bp preceding the decanucleotide repeat, plus the 
repeat, deleted. These observations establish that a 
distinct change in substrate specificity can be 
achieved by a short in-phase deletion of repeated 
codons (91). 

Significance of Repeated Sequences 

The recurrent structures that have been described 
above provide opportunities for restructuring the ge- 
netic composition of the bacterial chromosome in 
which they reside. Enzymes are present that catalyze 
either general recombination between homologous 
sequences (e.g., the RecA protein) or sequence-specific 
recombination (e g , the Hin enzyme), yet we observe 
that in nature rearrangements that change the order 
of genes in E~ coli or S. typhimurium genomes are 
rare. Much remains unknown about the interactions 
of the repeated sequences, the features about them 
that are important, and the mechanisms that regu- 
late the level of the genetic activities of the repeated 
sequence.. 
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FIG I Alignment of genetic maps of E. coli (EC) and S typhimurium (ST) . Genes that have been mapped in both £. coli (5) and 
S typhimurium (97) were aligned, starting at map position 0 Pairs of genes were not included if one of the two has been mapped 
only approximately (signified by parentheses in the data sources) In the map alignment, when a pair of genes was displaced by 0.6 
map units or more, adjustment was made to improve the alignment . Excess genetic distance in one of the maps is displayed as loops 
that balloon out from the paired regions to one side or the other. The precise position of the loops in most cases is not closely defined 
by the available genetic information Within the constraints of available genetic data, the positions of loops were chosen so as to 
minimize, where possible, the numbers of known genes on the loops and the total numbers of loops pet genome The zigzag symbols 
demark the ends of the genetic region that is inverted in one map relative to the other (19) Boxed allele designations indicate genes 
that reside at markedly different locations in the two maps Double-headed arrows indicate different map orders for two or more 
genes in £ coli as compared with 5 typhimurium. (Genes that were marked with an asterisk in the data sources, indicating that they 
have not been precisely mapped with respect to adjacent markers, were excluded from consideration in this connection.) 
Rectangular boxes set into the E coli map designate cryptic prophages. The genes for rRNA (the mi series) have been shown to map 
at analogous locations in E coli and S. typhimurium and are omitted from this diagram All alleles that reside in map segments that 
are designated as loops are shown in the diagram, including those genes that have been only approximately mapped Effort has been 
made to display gene locations accurately; nonetheless, for definitive information, especially in crowded sections, the original 
compilations of map information should be consulted 



56 • GENOME ORGANIZATION 975 



OCkA* 
onsA 



EC 



39 



V 



zi MoH G 

ihs 



37- 
38 



ST 



(777) 



chcW.A 
molB.A 
Itol 
HbB 
uvr C 
o, MqD 
C h °9 

pj supD 



his 

g*d 

rfbA.BD 
udK 



(dtd> \ 

\cdd 

go) / 

gyiA 

VglpT 
glpA 



p la 
hisJ.P 
putF 



38H 
40 



39 



41 
43 



43 
-45 



45 
48 



46 
49 



47 
50 



ten! 

chew. A 1 R 
mot A.B 
lloE.K 

uvrC 



(oHU 
i (HI) 

\noAi« 

SupD 



fioD 



his — 
gnd 



rlb- 



glpA* 
men" 

ack" 

hUJ. P 



purF 



nupC 
xap 



EC 



pd&B 

hisT 
fobB 
oroC 

dsd 



./pisH.I 
^crr 
cysA.K 



purC 



upp 
purM 
guoA.B 

hisS 



50 

48- 
51 
52 

50 
53 



h54 
52 



ranA - 

ung 

PSS 



56 



ST 

pdiB 
h»sT 
fobB 
oroC 



hyd 

recA 
srl 
mulS 

5Uys 



eno' 
pyrG 
reiA 

fuc 

argA 

recB 
recC 
thyA 
mulH 

•62 tupK 



53- 



54 

55 



55 



58 
59- 



h59 
60 



1-60 
61 



-61 
62 



EC 



serA 




\neiK 

g'c 



orgG 



iciC* 



ozoB r\g n B , 

pogf 



\glnF" 

\ (oldA) '"PS fctf 



srl 

hyd 



mulS* j 
F 

Cy5 |H 

eno" 
pyrC 



relA 

fuc 

argA 

mulH* 

recB/C 

IhyA 

gofR > 

lys J 



rid A J 



oroE 



rpsE 



rpsU 
crp 
orgD 

cysG 



oroB 
ompR 

roolO.T 
glpR 



SupK 

FIG 1 . Continued 



63 
63 



65 
64 



ST 

serA 
roetK 

end A 
meiC 




69 
69 



70 



72 
71 



73 
72 



74 
73 



75 
74 



orgG 
nlrA 
gltB 
EH] 
orgR 

envB 



oroE 
rpsE 

rpsL 

orgD 
crp* 

cysG 
oroB 

ompB" 

molO.T 
glpR 



ACCRETIONS TO THE GENOME 

Additions and Deletions Suggested by Map 
Comparison 

E. coli and S. typhimurium loops 

By aligning the genetic maps of £. coli and S. 
typhimurium, one sees first of all that the order of 
genes on the two maps is nearly identical, but also one 
sees that there are numerous locations where genetic 
distances between corresponding genes differ. The 
excess genetic distance in one genome relative to 
another can be pictured diagrammatically as a bulge 
or "loop " The two 1983 genetic maps for £ coli and S. 
typhimurium were placed side by side and were 
aligned. Wherever corresponding genes fell greater 



than 0,6 map units distant from one another, a loop 
was created to restore alignment. The configuration 
that resulted is depicted in Fig. 1. Altogether, the £ 
coli map exhibits 14 loops and the 5. typhimurium 
map has 15 loops, signifying unequal map distances 
in5 the two maps, comprising a total of 13 6 map units 
in the aggregate out of 100 map units total for both 
organisms Assuming that map distances are propor- 
tional to lengths of DNA, it appears that almost 14% of 
the DNA of the genomes of both £. coli and S. typhi- 
murium was acquired by insertion of segments of DNA 
at the loci of the loops, or lost by deletion of segments 
of DNA from the partner genome at these same loci, or 
both According to this view, the genome of the ances- 
tor of £. coli and S. typhimurium could have been 
either larger or smaller than contemporary genomes, 
according to whether the process of insertion or dele- 
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tion predominated.. A schematic representation of the 
relationship between the two maps is shown in Fig. 2 

Acquisition by transposition 

Loops could represent DNA that was acquired from 
another genetic source and incorporated into the 
bacterial chromosome* An important mechanism for 
the acquisition from another genome is transposition. 
Perhaps at least some of the loops represent acquisi- 
tion of genetic material horizontally from another 
genome, events that took place independently in E. 
coli and S. typhimurium, involving acquisition of dif- 
ferent genetic segments from different sources and a 
different set of genes in each organism 

Inspection shows that some of the unique phenotyp- 
ic characteristics that distinguish £. coli from S. 
typhimurium are encoded by genes that lie on loops of 
DNA (Fig i), as if these segments of DNA were 
acquired physically by one genome but not by the 
other. In S. typhimurium. the genes for inositol catab- 
olism, tricarboxylic acid and tricarballylic acid trans- 
port, H2 antigen, and the hin inversion system are alt 
present in S, typhimurium, not in E. coli. All of these 
distinctive genes lie on loops. Similarly, in £ coli, 



several distinctive genes, including the supernumer- 
ary duplicate argF gene, the lac operon, and the gene 
responsible for indole production, tna, are all found on 
loops. Also, the bgl and speC genes that are present in 
some £. coli strains but not in others are on loops The 
genes for galactitol catabolism, the gar genes, lie on a 
loop. As described above, the gat genes are subject to 
replacement in some £, coli strains with genes for 
catabolism of ribitol and D-arabitoI . 

Some genes appear to have been transposed in the 
past from one chromosomal location to another in the 
sense that some genes do not occupy homologous 
locations but instead occupy different sites in present- 
day £ coli and S typhimurium genomes. In some 
cases, one of each such a pair of genes is located on a 
loop. The cod, ma, pncA, and pgi genes map differently 
in the two organisms, and in one genome the gene 
resides in a loop as if that gene had been relocated as 
a consequence of transposition of the loop. 

Essential genes on loops 

Not all loops give the appearance of being recent 
acquisitions or of being mobile elements at present. 
Some of the loops contain genes that seem to be 
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EC 




FIG. 2 Schematic representation of the relationship be- 
tween E. coli and S. typhimurium genetic maps. The aligned 
portions of the two maps are represented as a single large 
circle. Excess map distances are represented as small circles 
extruded from the large circle, with those for £. coli on the 
outside and those for S typhimurium on the inside of the 
imaginary heteroduplex. 



essential to life and therefore are not the kind of genes 
one expects to have a history of recent acquisition 
from another source Examples are the £. coli loop at 
36 min that includes the pdxH, Ipp, and tyrS genes and 
the loop at 70 min that includes the rplY and mdh 
genes. These are genes for important functions One 
would not expect such segments of DNA to have been 
acquired casually as optional additions to the ge- 
nome. Other mechanisms must have generated these 
map position disparities, 

Fine structure of loops 

According to the arbitrarily chosen rules that 
were used for the map comparison exercise, the 
smallest loop structure is 0 6 map units in size. At 
the upper end, the loops were found to be 16 map 
units in size. If a map unit is taken as 45 kb, the 
loops range in size from 27 to 72 kb. In terms of 
individual genetic events, the loops are very large 
and only coarsely defined. The gross sizes and loca- 
tions of the structures as illustrated in Fig. 1 may 
reveal neither the full genetic history nor the com- 
plete present organization of any locus, but the 
diagram does identify genetic regions that may have 
undergone complex changes in organization during 
evolutionary divergence. Identification of loop loca- 
tions can be considered simply as a guide to those 
regions of the genome that may merit close compar- 
ative study. For instance, both ends of the £. coli 
loop at 6 5 map units, the lac-argF loop, have been 



examined in some detail and have provided infor- 
mation on the fine structure of this one loop. 

The argF-lac loop seems to be about 75 kb long It is 
not a single uninterrupted structure, but instead is 
composed of more than two smaller loops The argF 
gene, at one end of the loop, is situated within a 12-kb 
transposonlike segment flanked by two IS/ elements 
(54, 122). The lac operon, at the other end of the loop, 
is situated within another 12- to 1.3-kb loop of DNA 
which has no detectable homology to S. typhimurium 
DNA and which may have been an active transposon 
in the past (16, 65) Therefore, the loops of Fig 1 may 
appear to be a single, uninterrupted, nonhomologous 
region, but molecular analysis can reveal complex fine 
structures which reflect complicated past histories 
The true size of DNA segments that have been indi- 
vidually added to or subtracted from the two genomes 
as a consequence of a single genetic event may be 
considerably smaller than the size of the loops that 
are shown in Fig. 1, and the total numbers of addition 
or deletion events may have been much larger than 
the numbers of loops in the model 

Additions Suggested by Transposonlike Structures 

DNA segments that are too small to be seen in 
mapping data in terms of the loops may have been 
acquired by the £. coli and S typhimurium genomes 
Insertion (IS) elements are not shown on the maps in 
Fig. 1 both because they are smaller than the arbitrary 
size chosen for display and because they vary in 
numbers of copies and genetic locations in different 
strains of £. coli and 5. typhimurium The number of 
copies of IS sequences in a genome is highly vari- 
able, not only from species to species but also from 
strain to strain (54, 110). IS/, for instance, appears 
in one or as many as 30 copies in various £. coli 
strains but is not present in some S typhimurium 
strains, such as LT2 (83, 84) IS5 also shows varia- 
tion in numbers and location in £. coli strains (28, 
40). The numbers of IS4 elements vary from strain 
to strain, oddly enough not independently of the 
numbers of IS5 elements. IS4 differs from the other 
IS elements in that it seems to reside within a larger 
transposon, giving it the same flanking sequences 
regardless of its chromosomal location (28). The 
proAB-lac-proC region of the £ coli K-12 genome is 
highly active in this regard in that it carries a 
disproportionately large number of IS elements, 
including IS/, IS5, IS/2/, and multiple copies of IS2 
and IS3 (111). 

The IS elements in a bacterial chromosome might 
have been acquired by transposition from infecting 
plasmids. Not only IS elements, but also functional 
genes could be acquired from visiting plasmids As 
mentioned above, the DNA segment that contains the 
operons concerned with degradation of ribitol or 
arabitol is bracketed by large, imperfect repeated 
sequences, and the supernumerary argF gene, a dupli- 
cate of argl, is flanked by two IS/ sequences, sugges- 
tive in both instances of acquisition by duplicative 
transposition The gene for a-hemolysin activity, /i/y, 
is plasmid borne in some isolates of £. coli and is 
located in the chromosome in other strains. Evidently, 
the gene has moved from one genome to the other by 
transposition (20). 
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Acquisition of Genes from Temperate Phages 

Some genes have been acquired as remnants of 
degenerate forms of temperate prophages. The pres- 
ence of cryptic prophages in the chromosome is highly 
variable among E. coli strains. Some strains off. coli 
K-12 carry cryptic lambdoid prophages at map posi- 
tions 12, 30, and 34 and cany the e!4 prophage near 
position 25 min (Fig. 1) 

Cryptic lambdoid prophages 

At the rac locus at map position 30, there are two 
remnants of a lambdoid phage which contain a re- 
combination function that can be activated by zygotic 
induction (76) and that can contribute phage recom- 
bination genes to defective superinfecting X rev phages 
(59) The rac cryptic prophage can also contribute an 
alternative replication origin, oril, to its host chromo- 
some when the bacterial origin of replication, on'C, is 
not functioning (26). 

At two separate loci there are similar cryptic lamb- 
doid prophages, one called qsr* (sometimes designated 
QSR') at map position 20 (4, 58) and another called 
qin (sometimes designated kirn) at map position 34 
(11, 33) Both of these defective prophages have the 
ability to provide phage Q, S, and R functions to 
superinfecting mutant lambdoid phages (33, 109). The 
same qsr' prophage also contributes a phage gene for 
outer membrane protein {ntnpQ to defective mutant 
bacterial hosts that lack outer membrane protein 
genes (48) A variant of the nmpC gene is also present 
in phage PA-2 and is expressed in PA-2 lysogens (101) 
Thus, both cryptic and unimpaired prophages have 
the capacity to supply phage genes to superinfecting 
phage mutants and also to supply missing cellular 
functions to bacterial mutants 

el 4 prophage 

A final example of acquiring genes from phage 
genomes is the £ coli pin gene. The genetic element 
e!4 is inducible as a 14 4-kb circle and has a specific 
attachment site in E coli chromosomes nearpurB, but 
the defective prophage is present only in some £. coli 
K-12 strains, riot in E coli C or B/5 (15, 64). The el4 
prophage contributes the pin gene to the E. coli 
chromosome (92, 93, 113). This locus encodes a spe- 
cific DNA inversion system that is similar to the S. 
typhimurium Hin system for inverting the H2 antigen 
segment in S, typhimurium and also similar to the 
related systems in phages PI and Mu (22). Apparently, 
the £. coli chromosome has acquired the pin inversion 
system by integration of prophage genes. 

Clearly some functions in the bacterial genome 
originated in phage genomes. Ironically, these genetic 
functions may be "coming home to roost," since 
phage genes themselves may have originated long ago 
as fragments of bacterial genomes (30) 

SUMMARY 

The organizations of the £. coli and 5. typhimurium 
genomes are highly similar. Gene order has been 
preserved, with a few exceptions, over a long period of 
evolutionary time Both large and small repeated 



sequences are present in the £. coli and S. typhimuri- 
um genomes at all levels of organization Intrage- 
nomic interactions of some of the repeated sequences 
can bring about genetic rearrangement. But despite 
harboring a variety of repeated sequences, the ge- 
nomes tend not to undergo rearrangements frequently 
in nature, The mechanisms for regulating and pacing 
reassor tment of bacterial genes have not been identi- 
fied, Future analysis will, no doubt, lead to the discov- 
ery of molecular mechanisms which have determined 
the organization of the genes and continue to act to 
maintain the organization of the bacterial genome. 
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